首页> 外文OA文献 >Reinforcement Learning Produces Dominant Strategies for the Iterated Prisoner's Dilemma

【2h】

Reinforcement Learning Produces Dominant Strategies for the Iterated Prisoner's Dilemma

机译：强化学习为迭代课程提供了主导策略囚徒的困境

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We present tournament results and several powerful strategies for theIterated Prisoner's Dilemma created using reinforcement learning techniques(evolutionary and particle swarm algorithms). These strategies are trained toperform well against a corpus of over 170 distinct opponents, including manywell-known and classic strategies. All the trained strategies win standardtournaments against the total collection of other opponents. The trainedstrategies and one particular human made designed strategy are the topperformers in noisy tournaments also.

机译：我们介绍了使用强化学习技术（进化算法和粒子群算法）创建的迭代囚徒困境的比赛结果和几种强大的策略。这些策略经过训练，可以很好地抵抗170多个不同对手的攻击，包括许多众所周知的经典策略。所有训练有素的策略都赢得了其他对手的总标准比赛。训练有素的策略和一种特殊的人为设计策略也是在嘈杂的比赛中表现最佳的人。

著录项

作者
Harper, Marc; Knight, Vincent; Jones, Martin; Koutsovoulos, Georgios; Glynatsi, Nikoleta E.; Campbell, Owen;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Multiagent Reinforcement Learning: Spiking and Nonspiking Agents in the Iterated Prisoner's Dilemma [J] . Vassiliades V., Cleanthous A., Christodoulou C. Neural Networks, IEEE Transactions on . 2011,第4期

机译：多主体强化学习：迭代囚徒困境中的尖峰和非尖峰代理
2. Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's dilemma. [J] . Masuda N, Nakamura M Journal of Theoretical Biology . 2011,第1期

机译：在反复囚徒困境中具有动态期望水平的强化学习模型的数值分析。
3. An Adaptive Strategy via Reinforcement Learning for the Prisoner’s Dilemma Game [J] . Lei Xue, Changyin Sun, Donald Wunsch, 自动化学报：英文版 . 2018,第001期

机译：通过强化学习为囚徒困境游戏制定的自适应策略
4. Reinforcement Learning for the N-Persons Iterated Prisoners' Dilemma [C] . Agudo J. Enrique, Fyfe Colin Computational Intelligence and Security (CIS), 2011 Seventh International Conference on . 2011

机译：N人迭代囚徒困境的强化学习
5. The Influence of Opponent Strategy and Psychopathic Traits on Point Gains and Cooperation in the Iterated Prisoner's Dilemma [D] . Baggio, Mary 2018

机译：对手策略和精神病性状对囚徒困境中积分获取与合作的影响
6. Reinforcement learning produces dominant strategies for the Iterated Prisoner’s Dilemma [O] . Marc Harper, Vincent Knight, Martin Jones, 2011

机译：强化学习为迭代囚徒困境带来了主导策略
7. Reinforcement learning produces dominant strategies for the Iterated Prisoner's Dilemma [O] . Harper Marc, Knight Vincent, Jones Martin, 2017

机译：强化学习为迭代囚徒困境产生了主导策略

Reinforcement Learning Produces Dominant Strategies for the Iterated Prisoner's Dilemma

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅